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Abstract 



In this work, we identify the most general measure of arbitrage for any market model governed 
by Ito processes. We show that our arbitrage measure is invariant under changes of numeraire 
and equivalent probability. Moreover, such measure has a geometrical interpretation as a 
gauge connection. The connection has zero curvature if and only if there is no arbitrage. We 
prove an extension of the Martingale pricing theorem in the case of arbitrage. In our case, the 
present value of any traded asset is given by the expectation of future cash-flows discounted 
by a line integral of the gauge connection. We develop simple strategies to measure arbitrage 
using both simulated and real market data. We find that, within our limited data sample, 
the market is efficient at time horizons of one day or longer. However, we provide strong 
evidence for non-zero arbitrage in high frequency intraday data. Such events seem to have 
a decay time of the order of one minute. 



1 Introduction 



The no-arbitrage principle is the cornerstone of modern financial mathematics. Put it simply, 
an arbitrage opportunity allows an agent to make a non-risky profit with zero or negative net 
investment (see |DeSc08j ). Under the no-arbitrage assumption^] one can assign in a complete 
market a unique price to the derivative of any traded assets using the replicating portfolio 
method (see |MuRu07] or [CvZa04] ). The no-arbitrage principle can also be shown to be 
equivalent to a weaker form of economic equilibrium (cf. |CvZa04j ) and can therefore be 
seen as a form of market efficiency (see |Fam98] . [Ma03] ). It is then not surprising that most 
financial and economic literature is based on the no-arbitrage assumption. 

Nevertheless, the no-arbitrage principle represents a very strong assumption about market 
dynamics which must be tested empirically. Even when the market participants use no- 
arbitrage models, the ultimate price of any security which is traded in a centralized market 
is set by supply/demand and the complex dynamics of the order book. That is, "the market" 
sets the prices. It then makes sense to ask: how efficient are these final prices? In order to 
answer this question one needs a measure of arbitrage. 

There is a large body of empirical studies on financial arbitrage. Most of these studies 
focus on measuring the "excess return" of particular trading strategies (see e.g. [JeTi 93j. 
|GaGoRo 06j). Other studies try to find violations to general no- arbitrage relations between 
option prices (see e.g. |AcTi99j ). However, there does not seem to be a consensus on whether 
the reported market "anomalies" are due to arbitrage, or simply to random fluctuations (see 
|Ma03] . [Fam98] ) . Part of the problem is that there seems to be no general measure of 
arbitrage which can be applied to any traded asset. One of the main goals of this paper is 
to define such measure. 

The second goal of this paper is to provide a geometrical interpretation of the arbitrage 
measure. In particular, it has been speculated long ago by Ilinski jI197j . [1101] and Young 
|Y099j that arbitrage should be viewed as the "curvature" of a gauge connection, in analogy 
to some physical theories. The fact that gauge theories are the natural language to describe 
economics was first proposed by Malaney and Weinstein in the context of the economic index 
problem [Ma96j . [We06j . The need for such mathematical language can easily be seen from 
the fact that prices are only relational. More precisely, let X(t) = (Xi(t), ^(i), . . -Y be the 
price vector of all goods in the economy at time t, in some common unit (say USD). Since 
the measuring units are arbitrary, fundamental economic laws must be invariant under the 

1 In continuous time the no- arbitrage condition (NA) has to be sharpened to the no- free-lunch- with- 
vanishing-risk condition (NFLVR) . Only under the (NFLVR) the fundamental theorem of asset pricing can 
be proved, see [DeSc08j . Therefore, in this paper, when referring to (NA) in continuous time, (NFLVR) is 
meant. 
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transformations 

X(t) - A(t)X(t) , (1) 

where A(t) > is a positive stochastic factoJE In physics, a (local) transformation such as 
Eq. ([T]) is known as a gauge transformation. These represents a redundancy of our description 
of the economic system. The laws of the economy should be gauge invariant. The need for 
a gauge theoretical approach to economics was highlighted recently in [Sm09j . The role of 
gauge invariance in option pricing has been studied in [HoNe99aj . |HoNe99b] . [HoNeOO] and 
[HoNeVeOl] . For an unrelated use of differential geometric methods in (no-arbitrage) option 
pricing see [La08j . A recent proposal for a gauge connection in finance, and its relation to 
arbitrage, was presented in |Far08j . [Far09aj and [Far09b] . In fact, we will show that the 
curvature of the gauge connection proposed in [Far08] . |Far09aj and [Far09b] is equivalent 
to our arbitrage measure. 

In physics, curvature is a gauge invariant measure of the path dependency of some phys- 
ical process. For example, readers familiar with electrodynamics might recall the vector 
potential A^, where \x = 0,1,2,3 label the space-time directions. In differential geome- 
try for theoretical physics, A^ is known as a gauge connection. Now consider a charged 
particle which is traveling along some trajectory in space-time x M (s), s G [0,1]. The 
interaction of this particle with the gauge potential is proportional to the line integral, 
f A := J Q A fl (x(s))x ,M (s)ds. Now suppose that we make an infinitesimal change in the 
path of the particle <5x M (s), keeping the boundary conditions fixed: 5x^(0) = 5x u (l) = 0. 
The interaction changes by 5 y A j = v H§ ^x fJ, (s)x u (s)F flu , where F^ u = d^A u — d u A^ is 
known as the curvature of A. Therefore, we see that for zero curvature F^ v = 0, the line in- 
tegral f A is independent of the path 7. Moreover, note that the curvature is invariant under 
a gauge transformation of the form A^ — > A^ + <9 M A, where A is any function of space-time. 

We will find a very similar construction in the case of mathematical finance. In particular, 
we will show that the arbitrage curvature defined in this paper measures the path dependency 
of the present value of a self-financing portfolio of traded assets with fixed final payoff. The 
no-arbitrage principle is then equivalent to a zero-curvature condition. In analogy with the 
electromagnetic curvature F, we expect that any measure of arbitrage should be invariant 
under the gauge transformation in Eq. ([IT) . Moreover, the fundamental theorem of asset 
pricing states that the no-arbitrage principle is equivalent to the existence of a probability 
measure with respect to which asset prices expressed in terms of a numeraire are Martingales 
[MuRu07j . Therefore, we expect that any measure of arbitrage should also be invariant 
under a change of probability. These are, in fact, two very important properties that will 
characterize our arbitrage measure. 

2 For example, A(t) can be the EUR/USD exchange rate. 
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This paper takes a "macroscopic" or phenomenological approach to arbitrage. More 
precisely, we will study arbitrage from the point of view of general stochastic models. We 
do not address the question of what is causing the arbitrage. Our main goal is to identify 
the gauge invariant financial observables that indicate an arbitrage opportunity. We believe 
that a proper understanding of such quantity, including its geometrical interpretation, is a 
first step towards a theory of non-equilibrium economics. Moreover, our methods can be 
applied to the construction of profitable trading strategies. Our main assumption is that the 
prices of all financial instruments can be described by Ito processes. Moreover, we ignore 
transaction costg§. 

The organization and main results of the paper are the following. In section 2 we present 
the class of models that we use in the rest of the paper. They are very general and include 
the case of stocks, bonds and commodities, and more complicated derivative products. We 
decompose the dynamics of these models in terms of their gauge transformation properties 
with respect to Eq. (CQ). We identify the gauge invariants and show that they represent an 
obstruction to the existence of a Martingale probability measure. We conclude section 2 with 
an example with three assets, and we derive a modified non-linear Black Scholes equation 
with arbitrage. 

In section 3, we give a geometrical interpretation to the gauge invariant quantities defined 
in section 2. Our main goal is to identify the stochastic gauge invariants of section 2 with 
the curvature of a gauge connection. We begin with a review of the Malaney-Weinstein 
connection |Ma96] . [We06j . which is done in the context of differentiable economic paths. In 
section 3.1, we generalize the Malaney-Weinstein construction to stochastic processes and 
prove an asset pricing theorem. The main result of this section is that the present value of 
any self-financing portfolio of traded assets is given by the conditional expectation of future 
cash-flows, discounted by a line integral of the Malaney-Weinstein gauge connection. We 
show how the value functions of different portfolios replicating the same contingent claim 
are related to the arbitrage curvature. Finally, we show that the gauge connection recently 
proposed in [Far08] . [Far09aj and |Far09bj . is equivalent to the present construction. Readers 
interested in only the arbitrage measure and the detection techniques, can skip section 3. 
None of the results of section 4 require an understanding of the geometry of arbitrage. In 
section 4 we develop a simple algorithm to measure the arbitrage curvature using financial 
data. We explain the main sources of error in such measurement. The algorithm is applicable 
to any financial instrument. We provide examples with financial data of stock indexes and 
index futures. We find that, on long time scales, the market is very efficient. However, we 

3 Note that, as pointed out in |ShVi97| . many possible arbitrage opportunities disappear once one takes 
into account market frictions such as transaction costs. Therefore, it is important to keep in mind that even 
when we measure a non-trivial curvature in the market, it does not mean that it can always be exploited in 
a practical trading strategy. 
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provide strong evidence for non-zero curvature fluctuations at short time scales in the order 
of one minute. We conclude in section 5. 



2 Stochastic Models and Gauge Invariance 

Let Ai = {0, 1, 2, . . . , N — 1} be the set of all traded securities at any point in time in the 
market. We will use greek indices /x, u, . . . to label members of the set Ai. We denote the 
price of security /x G Ai by X^. Our main assumption is that the dynamics of all such 
securities is described by Ito processes of the form [So06j and [ShOOj: 



dX^ := X M (a„dt + tf dW ^J > V/x G Ai 



(2) 



where W a are standard Brownian motion such that W a (t) — W a (0) are independently and 
normally distributed random variables with, 

E [W a (t) - W a (0)\ = , Cov [W a (t) - W a (0), W b (t) - W b (0)} = td ab . (3) 

We make no assumptions about the number of Brownian terms, and hence the complete- 
ness of the market. The set of Brownian motions {W a } represent all the randomness in the 
market. Therefore, they induce a natural filtration T = (^i)t>o f° r our probability space. 
The coefficients and a a ^ can also be stochastic processes adapted^] to the filtration T . 
However, they are assumed to satisfy suitable conditions to ensure the existence of the price 
processes X^ (see |LaLa06j ). This class of models is very general and includes stocks, bonds, 
options, etc. Moreover, the case of fat tails in the distribution of returns is also included, 
since this is known to be generated by stochastic volatilities cr" 

Looking back at Eq. (j2J), we can see that the tangent space dX^ has a natural decomposi- 
tion into the directions which contain all the randomness (J^ a a^dW a ) and those orthogonal 
to it. Therefore, we will make the following decomposition of the drift term in Eq. (j2J): 



2 + ^/3^+ Ys aAj t > 



(4) 



4 In simple terms, a process p adapted to T means that it does not depend on future values of the 
Brownian motion. In other words, p(t) can only depend on {W n (s)} up to time s <t. 
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where M is the space spanned by basis vectors J A := [J A , . . . , Jfi-iV such that 

£ j m% b = ^> £# = o, E j >m = ' v «> (5) 



/' 



and 



= E J X ^:=^-o«, ^:=^E<- ( 6 ) 



MM M 



We will refer to M as the rra/Z space of the market. Note that this space is orthogonal to 
all the randomness in the tangent space dX^. However, we need to remember that M might 
be trivial. The definition of (3 a in Eq. (j3J) is not unique if the vectors <r^ := [(Xq, . . . , <3jv-i]^ 
are linearly dependent. This is the case of, for example, an incomplete market with more 
Brownian motions than traded securities. Moreover, a A is unique up to rotations in the null 
space. As we will see, the quantities a A are the unique gauge invariant measures of arbitrage. 
The two main goals of this paper are to give a geometric interpretation to the parameters 
a A , and to set up a procedure to measure them using financial data. 

Since prices are relative and only reflect an exchange rate between two products, the 
units used to measure are arbitrary. Therefore, the dynamics of the market must be 
invariant under a change of measuring units. In mathematical finance, this is known as a 
change of numeraire |MuRu07j . and it can be interpreted as a gauge transformation, 

X,(t) -> A(t)X,(t) , (7) 

where A is a positive stochastic process which is adapted to the filtration T . Another 
symmetry, which is special to the particular models of Eq. (T2J), is a transformation of the 
probability measure. This is not really a gauge symmetry, but corresponds rather to a change 
of variables of the form W a (t) -> W a (t) + f* 5(3 a (s)ds. 

Our next task is to study the transformation properties of the different terms in Eqs. (121) 
and (J4]). The following result follows. 

Proposition 2.1: Consider a change of numeraire of the form X^ — > AX^, where A is 
a positive stochastic process adapted to the filtration T , and dA := A(5adt + ^2 a Sa a dW a ). 
Then, the coefficients of the ltd processes, Eqs. and (fj]j, transform as 



a — ► a 



+ 5a + o~ a $o~ a , P a -> (3 a + 6a a , o a -> a a + 5a a . 



Finally, under a transformation of the probability measure given by the Radon- Nykodym 
derivative dP/dP* = exp [ — J2 a 5P a dW a — | f ^2 a (Sf3 a (s)) 2 ds) , we have the mapping of 



5 



standard Brownian motions 



and 



W a {t) Wait) + J 5f3 a is)ds 



(9) 



(10) 



In particular, it follows that a a , a and J A are invariant under such transformations. 



Proof: The result in Eq. 
product X' := AX^. 



above follow from a simple application of Ito rule to the 



dAX^ + AdX^ + d(A, Xfj) 



I a + 5a + aa5(ja + + 8(ja ) a l + aAj t J dt 

\ a a AeN / 



'ir, 



where d(A, X M ) = dtAX^ ^ a 5<7 a cr^ is the differential of the quadratic variation. The trans- 
formation in Eq. (jTOl) follows from a simple differentiation of W a in Eq. (l2l): 



a a A<=N / 

Y,K+° a )dw: 



(12) 



where we defined W a (t) = W*{t) + J 5(3 a is)ds. Note that both a A and J A are unchanged 
by these gauge transformations. In particular, suppose that £^ J A al = 0. Then, it follows 
from Eq. © that £ M J A + 5a a ) = £ M ^ = 0- 
□ 

So far we have taken the existence of the basis vectors J A for granted. A constructive 
procedure to find such basis, if non-trivial, is given by the following proposition. 

Proposition 2.2: Let Q be the symmetric and real N x N matrix with component VL^ U = 
^2a a ?i a v' w here N = dim(Jii). Moreover, define U as the matrix of all ones, e.g. = 1, 
V/i, v G M.. Then, the matrix G defined by 



G = Q - — iUQ + QU) + TriUQ)U 



(13) 
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is gauge invariant. Let Mg be the null space of matrix G such that ^ J M = for any non- 
trivial J G Mg- Then Mg = M . In particular, the space Mg is spanned by the orthonormal 
zero-modes of G which are orthogonal to the vector J = (1,1, ... ,1)' . 

Proof: First we need to prove that the space of vectors J such that jV a = 0, Va, is in 
one-to-one correspondence with the zero modes of Q: QJ = 0. Obviously, if JV a = 0, it 
follows that J is also a zero-mode of Q. To prove the converse, suppose that QJ = 0, but 
J'a a = X a , where X a ^ for at least one value of a. Then, = J^QJ = Xla(^ a ) 2 ' which can 
only be true if A a = Va. 

Now we turn our attention to the matrix G, defined in Eq. (1131) . Using the gauge 
transformation er" — > a?, + 5a a , we can see that Q transforms as 

a a 

It is then straightforward to verify the gauge invariance of the matrix G. Next we recall that 
the space M is spanned by (non-trivial) orthonormal zero-modes of Q such that they also 
satisfy ^ = 0. One can define a similar space Mg for G. It is easy to verify that any 
vector J G M is also a vector in Mg- On the other hand, for any vector J' G Mg, it follows 
from Eq. ( fl3l) that QJ' = 0. Thus, we have proven that M = Me- 
lt is easy to verify that G^ w = 0. Therefore, the vector J — (l,l,...,l)^isa particular 
zero-mode of G. Now take any other zero-mode of G, call it J', which is orthogonal to J. It 
follows that, = J* J' — ^2 J'^. Therefore, J' G Mg = M. This completes the proof. 

□ 

So far we have talked about the full set of securities of the market. However, it is clear 
that the decomposition in Eq. (j3J) can be done for any subset of the market. That is, suppose 
we observe a subset of the prices Xi, i G S C Ml. Moreover, suppose we find that within this 
subset one still can find some zero-modes J A obeying, ^ J fa? = 0, Va and Jf = 0. One 
can then easily lift these vectors to the full set Ml by taking Jj^ = (J A , 0). This represents 
a particular choice of basis in the null space M. By observing a sub-sector of the market, we 
will only have access to some of the components of a A . For notational convenience, we will 
not distinguish between the full market and a subset of it in what follows. 

Under the no- arbitrage assumption (see |MuRu07] . |CvZa04j ). it is always possible to find 
a common positive discount factor A and an equivalent probability measure P* ~ P such that 
the discounted prices AX^ are martingales: A(t)X^(t) = E*[A(T)X M (T)], where t < T. This 
is known as the Martingale representation theorem (see |So06] and [ShOOj ). In our language, 
this means that there is a gauge transformation mapping X^ to P*-Martingales. In other 
words, if there is no arbitrage, price processes are gauge-equivalent to P*-Martingales for 
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some probability measure P*. The result of the Martingale representation theorem can only 
be obtained if one is able to write // d(AX^) := // %dW* a for some adapted process 7". 



The reason is that the stochastic integral J t ^dW* is a Martingale: E£ J t ^dW* 



0. 



By proposition 2.2 there is neither a change of probability nor a choice of a positive discount 
factor for which the vector Y^AeN aA J A is mapped to (in contrast to a and all /3 a s which can 
indeed be made vanish). Therefore, it is easy to see that the term ^2iA&M aA ^ A parametrizes 
the obstruction to the existence of a Martingale probability measure for any discounted price 
process AX^. 

As a A s are gauge invariant quantities, one expects that they should be observables. In 
the next section we will relate this quantity to a gauge connection and its curvature. In 
section 4 we will show that such quantity can indeed be observed, and we explain simple 
strategies to measure it. Before concluding this section, it is instructive to a study particular 
example with three assets. 



2.1 An Example 

Consider the case of three assets X^, /1 = 0, 1, 2, where Xq is a savings account and Xi, X2 
are some other risky assets. All prices are measured in the same common units. We will 
assume only one Brownian motion. Therefore, the dynamics of the prices is described by 



dX = rX dt , 

dXi = Xi [audi + (JidW] 



1,2 . 



(15) 



For later convenience, we assume that the interest rate r is deterministic. In order to do the 
decomposition in Eq. (jlj) we need to find a basis for the null space N '. In this case, since 
there is only one Brownian motion and two risky assets, there will be only one null direction. 
To calculate it, we start by identifying the Q matrix: 



n 



(° 





\ 





°\ 




V> 




°l ) 


usin 


g Eq. 





(16) 



illuminating. The unormalized eigenvectors of G are found to be 

/-l 



(2 - 

CT1+C2 



Vi 



\ 








Vo 



3(71 \ 



V 



1 





) 



1 01+0-2 \ 

CTl— 202 
02— 2oi 
(J 1 — 2o2 

V 1 / 



(17) 



8 



where 



GV 1 = 0, GV 2 = 0, GV 3 = -(o* + o*-a 1 a 3 )V 3 



In order to find a basis for the null space M defined in Eq. (JSJ), we need to project V\ or V 2 
into the space orthogonal to the vector J° = (1,1,... l)t. To do this, we define the projection 
matrix 



1 



u 



U, P t 



u — Pu 



(19) 



where U is the 3x3 all-ones matrix. Note that 1 — Pu projects into the space orthogonal 
to J°. Our choice for the normalized null vector is then 



J 



1 



2a/0-J +0-3- 0-1(72 



( G\ — <7 2 \ 



(20) 



It is easy to verify that J obeys the properties given in Eq. (jSJ). 

We can now go back to the decomposition given in Eq. (Jl]). Using Eqs. (fl~5{) . we find 



a = r — /3<To — a Jq , cr = 



(21) 



where = — | Yll=o a v an< ^ ^ ^ s ^ ne arbitrage vector a A , which in this case has only 
one component a 1 := a. Therefore, inserting Eqs. ([21]) into Eqs. (fT5|) . we can write the 
evolution equations as 



dX 
dX l 

dXo 




2cr 2 — a 1 



2 \/ (T i + °2 - °"1°"2 

(72 — 2(7! 
2 V / °"l + °2 - 




(22) 
(23) 

(24) 



For a = 0, Eqs. (|22|) - (!24j) reduce to the familiar no- arbitrage Black-Scholes dynamics. As 
usual, (3 is interpreted as the market price of risk. Note that in this example, both risky 
assets are exposed to the same market risk factor W. The volatility a% measures the coupling 
to such risk. Under the no-arbitrage assumption, both assets should give the same expected 
return per unit of risk. This is (3. However, we see that if a 7^ 0, X\ and X 2 have different 
expected returns, even when they are exposed to the same risk. This discloses an arbitrage 
opportunity. 

There is a very interesting consequence of Eqs. ( 1221) - ( |24|) when X 2 is any function of 
X±, e.g. an option. For simplicity, consider the case where the only time dependence in at is 
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of the form a = a(t,Xi), where a(t,Xi) is a different iable function of t and X\. Moreover, 
the interest rate r is assumed to be deterministic. In this case one can derive a non-linear 
version of the Black Scholes equation with arbitrage. For ease of notation let X\ := X. 
Under our assumptions we will have that X2 = V(t,X). Then, using Ito rule we find 



dV 



d t Vdt + d x VdX + -d x Vd(X) 
V (a 2 dt + a 2 dW) , 



(25) 



where we identify 



ot 2 = — — I- ot\X — ^ h ~cr 1 X — — , cr 2 = 0\X - 



V ' V 2 
Comparing Eq. (f26l) with Eq. (|24j) we find 

cr 2 - 2(Ji 



(26) 



«2 = r + Pa 2 + 5 
where from Eq. (T231) 



— ^ — ■ = h OiA (J, A 

+ a 2 _ ^ 1 V 2 1 V 



= r + /Jo"! + a 



2cr 2 - cxi 



(27) 



(2J 



'2-^/^ + cr| — cri<7 2 

Therefore, after some algebra Eq. fl2T|) becomes a modified non-linear Black Scholes partial 
differential equation: 



d t V + rXdxV + \y\X 2 d 2 x V 



V2. 



r 



r 



1/2 



-r y = 



(29) 



Note that for a = this reduces to the familiar Black Scholes equation. The non-linear 
Black Scholes equation is a special case of the more general pricing theorem presented in 
section 3. 

It is important to remember that the arbitrage parameter, a in Eq. (1291) . can in general 
depend on time and the stock price. Therefore, in principle almost any deformation of the 
option price is possible. It follows that the Eq. (1291) can be solved only if the arbitrage 
dynamics is known. For example, consider the case where we set 



a :- 



2 3/2 



0"i 



X 2 d 2 x V 



(30) 



for some constant a(. Then, the option price obeys the usual Black-Scholes equation but 
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with the "wrong" volatility: 

d t V + r {Xd x V -V) + -alX 2 d 2 x V = . (31) 
This is a simple example of the well-known volatility arbitrage. 

3 The Gauge Connection 

The application of differential geometric ideas in economics can be traced to the work of 
Malaney and Weinstein ( |Ma96j . |We06j ). It was found that the solution to the apparent 
discrepancy among different economic growth indices could be solved by the appropriate 
choice of a covariant derivative. Such derivative has the property that a self-financing basket 
of goods is seen as "constant". More technically, a self-financing basket is interpreted as 
being "parallel transported" along a one dimensional curve in the a base manifold spanned 
by prices and portfolio nominals. Then, there is a natural geometric index to measure the 
growth of such basket, which was shown to be identical to the so-called Divisa Index. It is 
very illuminating to review this construction to gain intuition about the relation between 
arbitrage and curvature. In what follows, all quantities are assumed to be deterministic and 
different iable. We will return to the stochastic case in the next subsection. 

A covariant derivative induces a connection one- form in the base space, and in [Ma96j. 
|We06j this connection is given by 

where are the portofolio nominals, V = ^^^^X^, and the base space is parametrized 
by the coordinates (t, X^). Note that under a change of numeraire X^ — ► A^)^, the 
connection transforms as 

A -> A + dA . (33) 

This is the analog of the transformation rule of the vector potential in electrodynamics. 

A self-financing portfolio can be seen as being parallel transported with the connection 
A as 

V^= (d-A)V\.=0, (34) 

where is the covariant derivative along the trajectory 7. The solution to this equation is 
simply, 

V(T) 

e J ~< 



WH"^- (35) 
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where 7 is a particular self-financing trajectory (s,<fr(s),X(s)) , s G [t, T], and -D 7 is known 
as the Divisa Index. 

The dependence of -D 7 on the choice of curve 7 is parametrized by the curvature of the 
gauge connection, which is given by 

R = dA = 2 ^ {<t> v X v d<t> a A dX a - (peXyd^v A dX a ) . (36) 



co- 



Note that the curvature is invariant under a gauge transformation, as d(A + dA) = dA. 
In the approximation where economic agents are price takers, the price trajectory X(t) is 
given exogenously, and we are only allowed to make changes in the portfolio nominals <fi. In 
other words, we can write dX^ = X^dt in Eq. (|36|) . One can then restrict the curvature 
to the submanifold corresponding to the (t, 0^) coordinates. The induced curvature in this 
submanifold is given by 

R = V <\> a X v X a l^-^L]d(f)„Adt:=J2 Rrtdfa A dt . (37) 

In this case, the path dependency of the Divisa Index, Eq. fl35|) . can be written as, 

5 7 logL> 7 = V'/ dsR^ t (s)5(j)^s) , (38) 

where 5 7 represents a variation to the trajectory of the portfolio nominals. Therefore, we 
see that Eq. (1351) is independent on the path 7 only if the price trajectories obey the zero- 
curvature condition R^ t = =^> Xp(t) = a{t)X^(t), V/x. The zero-curvature condition 
implies that the prices of all securities evolve by the same common inflation factor. 

The relation between curvature and arbitrage goes as follow. Suppose that the prices 
obeyed the zero-curvature condition given above. It follows that, for any self-financing 
portfolio, we have 

V(T) = V{t)e$~< A = V(t)e^ ds , (39) 

for T > t. In particular, if V(t) = it follows that V(T) = 0. Therefore, it is not possible 
to make wealth without a positive initial investment. On the other hand, suppose that the 
curvature is not zero. Consider two portfolio trajectories 71 and 72 such that, say D 7l > D l2 
at some time T > t, for the same initial wealth V 71 (t) = V 12 (t) > 0. Now construct the 
difference portfolio with nominals <fi := — 02 and wealth function 

V = V 71 - V l2 . (40) 
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Then, at time T > t we have 

V(T) = (D J1 -D 12 )V J1 (t)>0, (41) 

while V(t) = 0. In other words, we have made wealth out of nothing. In the next section we 
show how this construction carries over to the stochastic case. 



3.1 The Stochastic Gauge Connection 

In the previous section we illustrated the relation between curvature, path dependency and 
arbitrage, using the Malaney-Weinstein connection. However, this construction only works 
for differentiable economic trajectories in the base space ((f), X). Nevertheless, we have found 
a direct analog of the Malaney-Weinstein connection for Ito processes, which we summarize 
in the following theorem. In order to avoid technical complications, we restrict our attention 
to an economy on a finite interval of time t G [0, T]. 

Theorem 3.1: Consider any self -financing portfolio V = ^) (p^X^, so that dV = (p^dX^. 
Then, there exist a (non-unique) equivalent probability measure P* ~ P under which the price 
processes obey 



dX^ — X M 



(42) 



Moreover, the present value of V(t) given some final payoff V(T), T > t, is given by 



VJt) = E* \v(T)e- J -< T 



(43) 



where 7 is some self-financing trajectory, and T is given by the expectation of the Malaney- 
Weinstein connection, 



r = e? 



dt + a*dt . 



(44) 



Finally, the path dependency of the present value of the portfolio, with fixed final payoff, is 
parametrized by 



Wy{t) = ~J2 dsE * \v{T)e-^ T 5<t>^s)R^ t {s 



(45) 



where Rn t are the components of the curvature two-form defined in the reduced base space 
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R = dT 



a A X u X a( f) a ( J* - J*) d(j> v A dt := ^ fl„,t<% A dt . (46) 



Proof: We start by writing the portfolio return as 



where 



dV = ^ dX n ■= v ( adt + Yl 6<Wa ) 

/Li \ a / 



E M A ,„ _ E M ^0 A 



Now consider the combination V := AV, where we take (of. Eq. (J2J)) 



dA = A 



A simple application of Ito rule gives 



^b a (3 a J dt-J2(3 a dW a 

a / a 



dV' = V'Y,(b a ~ (3 a )dW a 



(47) 



(Ah 



(49) 



(50) 



It is well known that any stochastic integral of the form J Q ^dW a is a Martingale |So06] and 
|Sh00] . Therefore, we have 

V(t) =Et \v{T)e^ dlosA \ . (51) 
A further application of Ito rule gives 

dlog A = -r - 1 J2(P a ) 2 dt - J]« , (52) 

a a 

where T is defined in Eq. (jBj), with a* := a — X]a/^ a °" a - 

Now consider making a change of probability measure such that W a := W* - J f3 a (s)ds. 
It is easy to see that, under P*, the price processes will obey Eq. (1421) of the theorem. 
Moreover, the Radon-Nykodym derivative is given by 



dP 

dP* 



exp 



(53) 
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Therefore, using Eqs. 



and (1551) in floTj) we get 



V(t) 



E* 

e; 



V(T)e 



- L r-| E a / t T (^) 2 d*-E if /?w a 



V(T)e"A r 



■ L r+| E a / 4 T (^ a ) 2 d*-E a Jf P a <iw; 



(54) 



In order to prove that T can be written as an expectation of the Malaney-Weinstein 
connection, we recall that 



E 



Ut)x,{t) 



a 



lim > 



- E * 



(ft 



lim E! 



st^o 



E 



dt 



dt 



dt 



e; 



(55) 



The last result of the theorem, Eq. (1431) . follows simply by making a small change in the 
portfolio nominals, and keeping the boundary conditions on V fixed. 

□ 

Note that the curvature of T is zero if and only if a A = 0, which is equivalent to the 
no-arbitrage condition. Moreover, the probability measure P* might not be unique, as the 
choice of (3 a in general is not. This also implies that a* is not unique in general. 

A special case of a self-financing portfolio is a portfolio containing just one base asset. 
Corollary 3.2: For all assets in the market model \x G M. 



X^t) = E* \x^T)e-^^* + ^ aAj ^ dt ' 



(56) 



In particular, under the no-arbitrage assumption a A = 0, we recover the classic Martingale 
pricing theorem: 

(57) 



X,{t) =E* X M (T)e-Ji a * dt ' 



In section 2.1 we derived a modified Black-Scholes equation for the case of three assets. 
Now we can use the result of Corollary 3.2 to prove a generalization of such equation. 
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Consider the following vector of assets 

X 



X Q ,X 1 ,...,X n ,<i> 1 (X,t),...,$ m (X,t) f . (5* 



We will label the components of this vector by X M , /i = 0, 1, . . . , n + m. Moreover, we assume 
that $j are smooth functions of the vector of underlying prices, X := [Xi, . . . ,X n ]\ and 
dX = X rdt describes a savings account with deterministic interest rate r. The functions 
t) describe a set of European-style contingent claims with final payoff $j(X(T),T) = 
fi(X(T)), for some fixed T >t. Finally, we need to assume that a A are either deterministic 
or some function of the underlying prices X. These assumptions ensure that the expectation 
values in the RHS of Eq. (j5B"|) are functions of X and t only, and so our assumption, $j = 
<&i(X,t), is self-consistent. Under these assumptions we can prove the following corollary. 

Corollary 3.3 (Modified Black-Scholes Equation): Under the assumptions given above, 
Corollary 3.2 implies that the European- style contingent claims $j ; % = l,...,m, obey the 
non-linear Black-Scholes equations 



dt$i + it( a * + Yl aAj f) X i d & ~ [ a * + aAj n + i) ^ + \ it tojkXjXkdjdkQi = 

j=l \ A J \ A J j,k=l 



(59) 

with terminal conditions $i(X,T) = fi(X). Moreover, J A = J A (X,t) are a basis for the 
null space M of the (n + m+l) x (n + m + 1) matrix Q with components = o^al, 
where af , i = 1, . . . ,n are the volatilities of the underlying securities, and we define <7q := 0, 

n 

<r a n+i ^^(J^logfi^t) , z = l,...,m, (60) 



and 



a = r 

A 



E^o. (61) 



Proof: The Eq. (|59|) of the corollary is a simple application of the Feynman-Kac theorem 
to Eqs. (156!) (see [ShOO] )). In order to calculate all components of the matrix Q, we remind 
the reader that the underlying prices X obey 

z = l,...,n. (62) 



dXi — Xa 



a 



j2* A Jf)dt+Y,<dw: 
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This implies that the stochastic part of J$j is given by 



d$i(X, t) = t) X A log %(X, t)a]dW: + ... . (63) 

3=1 

Therefore, the volatilities for the X n+i = securities are 

n 

oZ H = Y,<T*X J d j ]ogQ i (jt,t). (64) 

3=1 

Moreover, since X is a deterministic process, it follows that = 0. In order to prove Eq. 
(16 ip of the corollary, we recall that the savings account obey dX = rX Q dt. This implies 
that r = a* + ^ a ol a Jq. This completes the proof. 

□ 

The example of section 2.1 is a special case of the corollary 3.3, with n — m — 1. In this 
case there is only one null direction. We will use the notation X\ := X, $i := V, o\ := a±, 
and a 1 := a in what follows. A choice for the basis of the null space was given in Eq. (T20j) . 
which we repeat here for the convenience of the reader: 



7 = 



/2y/\ + XdX log V (XdX log V-1) 
It follows that Eq. (159|) becomes 



/l -X<9XlogV\ 
XSXlogV 

V - 1 / 



(65) 



= d t V+ [a* + aJi) Xd x V - (a* + a J 2 ) V + l -a\X 2 d x y 

= d t V + (r + a{J x - J )) X9 X V - (r + d( J 2 - J )) V + ^^X 2 9|V 
= ^V + r(X9XV- V) + ^?X 2 9|V 



+v / 2«V v / l + log V (XdX log V - 1) . (66) 
This is exactly what we obtained in section 2.1 (c.f. Eq. (1291) ). 

3.2 Relation to Farinelli Connection 

Before concluding this section, we would like to relate our connection T to another arbitrage 
connection proposed recently in [Far08] . |Far09a] and [Far09b] . We will show that the con- 
nection presented in |Far08j . |Far09aj and |Far09b] is equivalent to Eq. (1441) . Consider an 
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economy in the time interval t e [0, T], composed of some general "base assets" with prices 
Xi, i — 1, . . . , N, and synthetic zero-bonds on these assets. The price of a zero-bond (in the 
same units as XA can be defined as 



Zi(t,T) := Bi(t,T)Xi{t) , i = l,..., N 



(67) 



where B{(T,T) = 1 and B(t,T) > 0, if T > t. In other words, Zj pays one unit of asset 
Xi at maturity. Moreover, the dynamics of Bi(t,T) can be traced back to a T-independent 
stochastic, which we call the spot rate Ti. Explicitly, 



d 

ri(t) : = hm -—logBi(t,T) or equivalently Bi(t,T) := E t 

The following gauge connection was proposed for this economy^] 

J2i X (d(f>i - rifadt) 



(68) 



F 



"H &,, J — w — 



(69) 



Our goal is to relate F to our connection T, Eq. (144)) . restricted only to the base assets. 
In order to do this, we define a larger price vector with components X^, // = !,..., 2N: 



X(t) = (X 1 (t),..., X^Z^t, T),...,Z N (t, T)) t . 



(70) 



We now do the usual decomposition of dX^ as in Eqs. ([2]) and (j4j). Using the pricing formula, 
Eq. (1561) . we find that 



Bi(t,T) = E* 
= Et 
= Et 



e ~ f(«*+EA a A J l A +N )dt'+f t T dlogX l 



■ f[(«*+EA ^Jf'+N+T.ainVm'+Y.a P"^Va ~d tog Xi] 



■ It r ds)ds 



(71) 



where we have extracted the spot rate as 



(72) 



Moreover, since lim T ^t+ Zi(t,T) = Xi(t), and by assumption is independent of T, it must 



5 In |Far08j . |Far09aj and |Far09bj . Stratonovich calculus was used instead of Ito [VKr81j . Here the usual 
differentiation rules apply. 
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be that 



Ji+N ~ Ji ■ (73) 

Therefore, we can finally identify the spot rate as: 



A a a 

Inserting Eq. (I74p in fl69|) we obtain 

-dt + dA , (75) 



where 

dA = - I a* + ^ > ]{(3 a f I dt- > '« + dlog I > I . 



^ E ^ 2 ) rft " E ^ W <> + d lo S ( E ^ ) 

a J a \ i / 



Therefore, we conclude that the connection F is equivalent to T up to a sign and a gauge 
transformation. 



4 Measuring Arbitrage Curvature 

In this section we explain how to estimate the arbitrage parameters a A using financial data. 
Given the discussion in the previous section, measuring these parameters is equivalent to 
measuring the "curvature" of the market. Needless to say, one can do this for a subset of all 
instruments only, and there are many technical difficulties which we discuss below. 

Even though a A is a gauge invariant, it is still defined up to a rotation in the null spac^§ 
A/". Therefore, the basic idea is to measure the rotational and gauge invariant quantity 

E = £<«*>' >0. (77) 

where a A in the left hand side of this equation is expressed as 

Jt. dX, 



^ Y v 



The vectors J A must be calculated using an estimate for the quadratic variation Vt^ u : 



6 For notational simplicity, we will still use M for the null space of the particular market sub-sector under 
study. However, it is important to keep in mind that this is not the null space of the full market. 
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d( logX M , \ogX v ) /dt, and the results of Proposition 2.2. We introduce the notation A 2 := 
^2a(c( A ) 2 for the measurement of arbitrage curvature. A positive detection of A 2 can be 
translated into a self-financing arbitrage portfolio strategy using the result of the following 
proposition. 

Proposition 4.1 (Arbitrage Strategy): Let the asset corresponding to \x = be the 

numeraire (X := 1). If the market model satisfy the positive curvature assumption 

A 2 > 0, (79) 

then the portfolio allocation 

N - t 

i=l J ° A (g0) 

is a selfinancing arbitrage strategy delivering wealth 

V(t)= [ A 2 {s)ds. (81) 
Jo 

Proof: First, we check that the strategy is self-financing, that is 

N 

^( C /0 M X M + rf(0 M ,A M ))=O, 
where d denotes the Ito differential (see |LaLa06j . Chapter 4.1.2) 



(82) 



This is proved by the 
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following computation: 



N N 



+ d{(j>^ X„)) = d<f) X + d(<f) , X ) + E ( d & Xi + 

N 

= E E ^jfdXi/Xi + Y,d {a A j£) 



A i=l A 

N / \ N 



+ E x ^ d E « A J /V^ + E d < E aA Jt/Xi, x^ 



i=l \ A / i=l A 

iV 



= e^kt^ 

A \ fj,=0 

= 0. (83) 
Since the self-financing condition is fulfilled, the portfolio value can be computed as 



N t N 

v(t) = Y,MWt)= / ^)^)+E a "E^ 



=0 

^ jAf \ rt N jAf \ 

u A i=l n 7 u A n=0 ^ K ' 

= fy2a A {sfds= f A\s)ds. (84) 
Jo A Jo 

Since the arbitrage curvature are positive, we see that V(0) = and V(t) > for all times 
t G [0, T]. The proof is completed. 

□ 

Of course, there is no continuous time trading in the markets, and we can only do 
measurements in discrete time. Moreover, our estimate of Q will always include errors. This 
means that we will always have a noise term in the right hand side of Eq. ( 1771) . The goal of 
this section is to explain the basic steps used to measure arbitrage curvature, and understand 
the major sources of error in such measurements. A key aspect of our algorithm is that we 
test directly for the gauge invariance of the arbitrage signal. This allow us to check the 
robustness of our estimators. We find that the gauge invariance of the arbitrage signal, as 
predicted by the stochastic models, is indeed obeyed with good accuracy in the real market. 
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4.1 Basic Algorithm 



In what follows, we will use a hat in any variable which is an estimate of some parameter, 
e.g. Cl is an estimate for Q. The first problem we face is to find an estimate for the quadratic 
variation Q and to determine the null space M defined in section 2 (if non-trivial). This 
is a familiar problem in volatility modeling. Since we will never observe Q directly, it is 
expected that our estimate will not have any exact zero-mode, but only eigenvectors with 
small eigenvalues. In fact, a priori, we do not know if the space M is non-trivial. We can 
only guess its dimension. 

Let Cl be any estimate for Q. Then, following Proposition 2.2, we construct the matrix 

G = Cl - — (uCl + hu) + — Tr(uCl)U , (85) 
N V / N 2 

where N is the number of rows (or columns) of Q and U is the matrix of all ones: U^ v = 
1 , V/i, v. We can then use standard algorithms to compute the eigenspace of G. This will 
yield orthonormal eigenvectors 

GJ A = X A J A , (J A yj B = 8 AB , A,B = 7 1,...,N-1, (86) 

where X A > since G is positive semidefinite. As a matter of fact, since Q and U commute, 
the have a common basis of eigenvectors and a short computation proves that G has always 
(at least) one zero eigenvalue and the biggest N — 1 eigenvalues equal those of Q, which are 
not negative (c.f. Proposition EJ2). In practice, there will only be one exact zero eigenvec- 
tor: J° oc (1,1,..., 1)t. Summarizing, the eigenvalues of G ordered in increasing order of 
magnitude are, 

= A < A 1 < A 2 < . . . < A^ 1 . (87) 
It is easy to show (c.f. Proposition 2.2) that 

5^ = 0, for A=1,2,...,JV-1. (88) 

Our estimate for the basis of M will be to chose the first k eigenvectors with the smallest 
eigenvalues: J A , A = 1, . . . , k < N — 1. In doing this, we are assuming that dim (A/*) = k. 

Once, we have calculated J A , we can compute our estimate of a A in discrete time: 



J A (t) 



(89) 
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Note that J (t) is constructed with information up to time t only. This estimate is consistent 
with the non-anticipating nature of Ito integrals. The time step St is, of course, arbitrary. 
Our estimate for A 2 now becomes: 



A=l 



A=l 



(90) 



In the limit of short time scales, and if there is non-trivial arbitrage, we expect that this 
estimator converges to the true signal 

A 2 {t + St) = A 2 {t) + ^a A da A = A 2 {t) + 0{St) (£i->0). (91) 

A 

The convergence in Eq. fl9Tj) is only valid if in the limit St — > we have 

Et [a A {t + St)} - a A {t) = 0{St) , Cov 4 [a A {t + St), a B {t + St)] = 0(St) . (92) 

Therefore, we expect that, if there is non-trivial arbitrage in the market, the estimator (1901 
will give us a positive signal on average. Since the time scale is arbitrary, it is convenient to 
set St = 1 henceforth. 

There are several candidates for an estimator for Q. The "right" choice of Q should 
reflect our believes about the true dynamics of the asset values. Here we will simply take the 
empirical estimator for covariance of the time series of log returns for a window of length L. 
More precisely, our data consist of a number of time series for the prices X^, /i = 0, . . . , N — 1 
in certain units, say USE0. Our estimator reads 



L-l 



^u{t) 



X^t-i) 
XJt -i-l) 

L-l 



t5> 



log 



1 

1? 



5> 



i,j=0 



X^t-i) 
X„(t -i-l) 



X u (t - i) 
_X v {t-i-l)_ 

X v (t-j) 



log 



X v (t-j-l] 



(93) 



For more sophisticated estimators see |HaHaPi08l IZh06] . We are now in position to summa- 
rize the most basic algorithm to detect arbitrage. 
Algorithm: 

1. Starting with the time series for X^, \x = 0, . . . , N — 1 in an interval [t,t — L], we 
estimate fl^it) using Eq. (Tff5)). 



7 We also include the USD itself as an asset in which we have Xn = 1. 
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2. We then calculate the G matrix using Eq. (E2JI, and its orthonormal eigenspace. The 
eigenvectors will be labeled as J A , A = 0, 1, . . . , N — 1, in order of increasing eigenval- 
ues: = A < A 1 < . . . < A^" 1 . Moreover, J° oc (1, 1, . . . , l)t. 

3. Given a guess for the dimension of the null space k = dim{M), we take as its basis the 
following eigenvectors of G: J A , A = 1, . . . , k. 

4- We then calculate a A (t + 1) from Eq. / fffPj) . which uses information up to time t + 1. 

5. Roll the time window by one step, and repeat steps 1-5. Once we have more than one 
estimate for a A , we can calculate our final arbitrage estimator A 2 from Eq. (E2P- 

6. In order to explicitly check for gauge invariance, we repeat steps 1-5, using each asset 

as numeraire. For example, if we want to use X\ as numeraire, we divide all ele- 
ments of the time series by the corresponding element ofX\, e.g. X^s) — > X fl (s)/Xi(s) 
V/i and s G [t, t — L] . Then we repeat steps 1-5 with the new time series. Note that 
this is a non-trivial transformation in the data and, in practice, we will get different 
estimates for a A . 

Before discussing the results of the algorithm, we need to understand what are the main 
sources of error in our signal. This is done in the next subsection. 

4.2 Sources of Error 

The sources of error in our measurement of A 2 can be divided in three groups. First, there 
is gauge dependence. Second, there is a gauge invariant noise, which we will discuss below. 
Finally, when using high-frequency financial data, one is faced with the so-called market 
microstructure noise which is partly due to the bid/ask bounce effect |HaHaPi08] . 

We begin with looking at sources of gauge dependence. Note that our construction of the 
estimators assumes that, under a gauge transformation, Cl transforms like Q, c.f. Eq. (114j) . 
However, the gauge transformation rule in the real world can be quite different, because the 
unknown effective dynamics could lead to gauge dependences. We do not have an a priori 
test for this source of error. The only way to test for it is to make our calculations in different 
gauges and see how different the answers are. We will show examples of this in the next 
sections. 

The second source of error in our signals come from a gauge invariant noise term. In fact, 
we will see that this is the dominant noise contribution. In order to understand this noise, 
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it is convenient to discretize the Ito integral and write our estimate for a A as 



fj, M\ / 
ft, a 

:= a t t nd (t) + 6 A (t + l). (94) 
Here we have decomposed the signal in a trend 

<LdW := E + E J£(t)a^(t) , (95) 

and a stochastic noise term 

6 A (t + 1) := E + !) - W M , (96) 

with Ej [e^^+l)] = 0. Since J A is only an estimate for the real J A , we have that J A(J 1i 
in general. Therefore, our error in the estimate of J A will induce an extra noise term in the 
signal. Moreover, it will also induce some gauge dependency. To see this, note that under a 
change of numeraire, we have cr" — > er" + 5a a and (3 a — > (3 a + 5a a . It is then easy to check 
that the trend will transform as «^ cnd — > ot A vcnA + J2 fia J / A o'1 l 5a a . However, note that the 
noise term is gauge invariant. In fact, one expects the term Jfofi to be quite small. 
Moreover, since in our algorithm the gauge transformation is of the order 5a a = C?(cr"), we 
expect the gauge dependence coming from the trend to be negligible. We will see that, in 
real financial data, most of the signal can be accounted by the gauge invariant noise term. 
We are interested in estimating the size of the noise contribution. For that, we compute 
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the variance of the noise using information up to time t: 



Var t 



^a A {t)a A {t+l) 



E 



J2 &A ( f ) {a A (t+l)-E t [a A (t+l)]) 

A 

Y,& A (t)a B {t) {[J A (t^G(t)J B (t) 

A,B 

Yya A {t)?\ A {t) 

A 

+ ^2& A {t)a B {t) ([j A (t)] ] 5G{t)j B (t) 



A.B 



(97) 



where 5G = G — G. If we think that our estimate of G is good, we can neglect the 0(SG) 
term and approximate 



Var, 



A\t + 1) » J>^)] 2 A A (t) < 5>^(i)] 2 A fc (t) 



(98) 



where we remind the reader that k is our estimate for the dimension of TV, and the eigenvalues 
of G have been ordered so that A 1 < A 2 < . . . < X k . 

An interesting consequence of Eq. (1981) is that one can put a fundamental bound for the 
size of the arbitrage curvature, in order to be detectable. We have, 



A 2 > WVar 



A 2 



A 2 > A fe 



(99) 



This means that, in order to have a chance to detect arbitrage, one needs to find financial 
products whose time series are as correlated as possible, which implies a very small value of 
Afc- 

The third source of error is market microstructure noise. This effect is relevant in high 
frequency data, when the size of the price movements is comparable with the bid/ask spread. 
In order to model this noise, it is convenient to set X Q := 1 as our numeraire. The standard 
way of simulating this noise is to introduce an additional jump term rji(t) to the log prices 
Xi, % = 1, . . . , N. More precisely, the observed price is X, and it is given by, 



log Xi{t) = log Xi {t)+T]i{t) 



where X^ is the "true" Ito process, and for simplicity we assume 



(100) 



E[X iV:j } = , Efa] = , EfoiJfe] := r) 2 b. 



(101) 
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Moreover, the noise terms are uncorrelated between different times. One can then show that 
our estimator will be contaminated by an amount 



Jim E[^ 2 ] = E[„4 2 ] - 7 1- + O ( JL ) , (102) 



, 1 - + o(^ 

st^o 'St 2 \St 2 

where 7 := dim(jV) - E[(£ A j o) \- lt 

can be shown that 7 > 0. Therefore, we see that the 
microstructure noise leads a negative contribution to our estimation of A 2 . The absolute 
value of such contribution diverges as we move towards higher frequencies (St — > 0). 

One way of detecting the presence of microstructure noise is to note that 



lim E 



log (Xi(t + St)/Xi(t)^ log [Xi{t)/Xi{t - ft))] = -if < . (103) 



In other words, the microstructure noise induces a negative correlation between subsequent 
log returns. We find that this effect is quite pronounced for equity and futures data. However, 
for stock indices, the effect seems to be negligible. This is mainly due to the fact that the 
microstructure noise "averages out" between all the stocks in the index. 

There is an extra source of error which is intrinsic to the algorithm, but only if we use a 
rolling window in our estimation of J A . For example, suppose that we estimate J A (t) and 
then roll the window and estimate J A {t + 1). Even if the matrices G(i) and G(t + 1) are 
near, J A (t) and J A (t + 1) can differ by a large orthogonal transform. It can be just a sign 
flip f.i., since the eigenvalue equations are invariant under J A — > —J . However, suppose 
two eigenvalues are near to each other, i.e. Ai ~ A2. Then, any linear combination of J 1 and 
J 2 is also approximatively an eigenvector of G. In physics, this is known as the problem of 
degenerate perturbation theory (see f.i. |Sa94j ). More generally, we have that 

lim J A (t + 1) = V C AB J B (t) , (104) 

||G(t)-G(t+l)|H0 

where C is an orthogonal matrix, i.e. 

C^C = 1 . (105) 

The problem can be solved if we can determine C. If so, we can construct the "correct" 
eigenvectors J A (t + 1) := ^(0^)^ J B {t + 1) so that lim ||5( j |Ho J A (t + 1) = J A (t). An 
approximate solution for C is presented in appendix A. This is implemented in our numerical 
routines. 
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4.3 Simulated Data 



In this section we apply our algorithm to simulated financial data. For this, we study the 
simple log-normal random walk model with constant coefficients (c.f. Eq. (l2l)). The solution 
to the stochastic differential equation <^ is 



X lt (t)=X ll (Q)exp 



«m 2 



a=l / a=l 



(106) 



where B(t) := [B\(t), . . . , Bd(t)}^ is standard multivariate Brownian motion with 

E[B«(i)]=0, Cov[B a (t),B b {t)]=t6 ab , (107) 
for all a, b = 1, . . . , d. As usual, we decompose the trends as 

d N-l 

a fl = a+Y,^; + J2 aAj ^ ^ = <-]vE<- ( 108 ) 

a=l AeM u=0 

We begin with an example with N = 21 assets and d = 18 Brownian motions, which implies 
k = dim (A/") = 2. We take as a first asset a bank account with zero interest rate, and make 
it to our numeraire. This means that we choose, 

X :=l, a a = 0, a = 0, (109) 



which implies 



i=l a=l i=l A=l 

In figure 1 we show a particular simulation of the log prices, where we take /3 a , er", a A 
from uniform random distributions in the intervals, (3 a G [— 10~ 4 , 10~ 4 ], af G [— 10~ 3 , 10 -3 ] 
and ot A G [— 10™ 4 , 10~ 4 ]. The simulation was generated using Mathematica. The arbitrage 
detection algorithm was implemented in C++. Each price was taken at a time separation 
of At = 1 (arbitrary time unit). In this particular case we calculate Cl using the first 100 
prices of the time series. In other words, we do not use a moving window. The results with 
the moving window are very similar. 

Now suppose we assume (correctly) that we have k = dim (A/*) = 2. We then run the 
algorithm and find the signal shown in figure 2. The solid horizontal line at A 2 ~ 10~ 8 is 
the correct value of A 2 . Therefore, we see that we get an accurate estimate for the arbitrage 
curvature. Note that, as we discussed in the previous section, in our algorithm we compute 
A 2 using each of the different assets as numeraire. We include error bars showing the range 
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Figure 1: Simulation with 20 log-normal random walks. 

of values obtained using the different gauges. The results in this simulated sample are gauge 
invariant to such high accuracy that the error bars cannot be appreciated. 

In the previous section we discussed how the main source of error in our detection tech- 
nique can be related to the biggest eigenvalue A& of the set {Ai, . . . , A^}. This led to a gauge 
invariant noise term. In this simulated sample data, we find that A^ ~ 10 -21 , and so using 



Eq. (|98p we find, y Var[^4 2 ] ps Vl0~ 8 x 10~ 21 ~ 10" 15 . Therefore, this noise term is very 
small in this case. The fluctuations seen in figure 2 are an artifact of this particular model. 
To understand them, we can expand Eq. ( 11061) as 

^±a ^tp^(im t ..., mi) 

where 

H = \ (-tow + ^offiB a (l)B b (l)J . (112) 
It is easy to show that is gauge invariant and 

E[e M ] = 0, E[e^] = ^(^) 2 . (113) 

This extra noise term, e M , is the reason for the gauge invariant fluctuations in figure 2. The 
noise term vanishes, if we integrate dX^ using an infinite partition of the time interval, as it 
is assumed in Ito integrals. Of course, this is never possible in practice. Nevertheless, we see 
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that in this example, the extra noise is very small compared to the arbitrage parameter A 2 . 
In fact, we expect this noise to be very small in general since it is of order Var[e^] = 0((<t°) 4 ). 

It is interesting to see what happens if we assume the wrong number of zero modes. For 
example, in figure 3 we show what happens if we take k = 1. We see that we get a gauge 
dependent signal. Finally, in figure 4 we show what happens if we assume k = 3. In this 
case, the biggest eigenvalue is A*. ~ 10~ 8 . As the figure shows, most of the fluctuations are 
coming from the gauge invariant noise described in the previous section. To see this we have 
plotted the expected noise according to Eq. (1951) : 



noise±(t 



A 2 ± ^Var t [A 2 (t + 1)] 



A 2 ± 



\ 



j2(& A (t)n A (t) , (H4) 



A=l 



where A 2 ~ 10~ 8 is the true value of the arbitrage (which is also the mean of the signal). 
We see that this noise accounts for most of the fluctuations and it makes the true arbitrage 
signal almost undetectable. The main point we would like to make here is that the correct 
value of k can be estimated from the quality of the signal. 
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Figure 2: Result for the arbitrage detection algorithm applied to the simulated data in figure 1. 
Here we assume (correctly) k = 2 null directions. The solid horizontal line at A 2 ~ 10~ 8 is the 
correct value of A 2 . The red solid line is the USD value of the signal. 

We can now investigate the effect of the market microstructure noise discussed in the 
previous section. In order to do this, we include additional white noise terms to the price 
processes of Eq. (11061) as described in the previous sub-section (c.f. Eq. ( llOOp ). In this 
particular example we choose the variance to be rj 2 = 10~ 5 . We then apply the noise to 
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Figure 3: Result for the arbitrage detection algorithm applied to the simulated data in figure 1. 
Assuming k = 1 null directions (incorrectly), and using a fixed window of 100 time steps. The 
error bars give the range of values obtained using the different gauges. The solid dots are the mean 
of all results. The red line is the USD signal. 

the data of the previous example. In figure 4 we show the result of the estimate of A 2 for 
the contaminated data. In this case we assume (correctly) that the dimension of the null 
space is k = 2. We can clearly see how the signal is now negative on average, due to the 
microstructure noise. This matches the theoretical prediction in Eq. (11021) . In figure 6 we 
plot the product of subsequent log returns according to (c.f. Eq. (11031) ) 

:= -JF^i E lo § + i)/^)) lo § [Mt)/Mt - 1)) , (us) 

i=l 

where X t are the contaminated prices. According to Eq. (11031) of the previous sub-section, 
we should have E[r} 2 ] = rf. This is precisely what we observe in figure 6. In the next 
subsection, we will see that such signals are very typical of high frequency security prices. 

4.4 Market Data 

In this section we present some examples of our arbitrage detection algorithm applied to 
real financial data. We begin with a look at three major US stock indexes: the Dow Jones 
Composite Average (DJA), the NASDAQ Composite Index (IXIC) and the NYSE Composite 
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Figure 4: Result for the arbitrage detection algorithm applied to the simulated data in figure 1. 
Assuming k = 3 null directions (incorrectly), and using a fixed window of 100 time steps. The 
green lines are the noise terms noise± estimated according to Eq. The error bars give the 

range of values obtained using the different gauges. The solid dots are the mean of all results. The 
red line is the USD signal. 

Index (NYA). Due to their similar nature, we expect strong correlations between these 
indexes. Our first sample consist of daily closing prices from September 1 2004 to July 
16 2009, a total of 1227 data points. The gauge invariant matrix G has been estimated 
using a moving window of 500 days. We have found the following values for the eigenvalues: 
Ai ~ 2 x 10~ 3 , A2 ~ 5 x 10~ 3 , A3 ~ 2 x 10~ 2 . Therefore, it is reasonable to assume that the 
null space has only one dimension, k = 1. The result of the arbitrage detection algorithm 
is shown in figure 7. We can see that the signal is indeed gauge invariant to a very good 
accuracy. In figure 7 we have also included an estimate for the gauge invariant noise term 
described in section 4.2. In this case we have assumed that the average of the signal is zero 
(i.e. no-arbitrage), and so our estimate for the expected noise is 

k 

j2^ A (twut) . (He) 

A=l 

Looking at figure 7, we see that the noise can explain most of the signal. Therefore, we find 
that our results are consistent with A 2 = 0, and hence no arbitrage. In figure 8 we show a 
histogram of the different values of A 2 . As pointed out above, the signal is consistent with 



Vzr t [A 2 (t + 1)] = ±^ 
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Figure 5: Result for the arbitrage detection algorithm applied to the simulated data in figure 1, now 
including micro structure noise with variance rj 2 = 10 -5 . Here we assume (correctly) k = 2 null 
directions. 



A 2 = since the signal to noise ratio is very low: E[^4 2 ]/-^/Var[^4 2 ] m —0.0709. 

It is very instructive to look at the trading strategy exploiting the arbitrage discussed in 
PropositionHll. In discrete time, the initial value of this portfolio is V(0) = ^ M (O)X M (O) = 
and the value at time t is simply 

v(t) = EE^)») ^ + A"i x,l(i) = £^ 2(s + 1} • (117) 

s=0 A,n ^ ' s=0 

In figure 9 we show the value of this portfolio for the daily data of the three US indexes. 
We include the integrated profit and losses of the indexes themselves for comparison. We 
have multiplied the index signals by a numerical factor so that it fits in the same picture. 
Therefore, the overall scale in the y-axis is irrelevant. We can see that, as expected, that the 
performance of this portfolio is very poor for such low frequency data. 

Next we look at the same index set (DJA, IXIC, NYA), but now at short time scales. As 
an example, we study high frequency data obtained on July 28, 2009. The data points have 
7 to 10 seconds in separation. The data was collected using the "FinancialData" package of 
Mathematica. The gauge invariant matrix G has been estimated using a moving window of 
500 data points. We have also assumed one null direction (k = 1). A sample of the arbitrage 
detection algorithm is shown in figure 10. It is quite obvious from this figure that the signal 
has a very significant positive skewness. In fact, a prominent feature of the signal is a series 
of positive peaks. These transient events have a duration of the order of 5-10 time steps, 
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Figure 6: Product of subsequent log returns in the presence of micro structure noise, according to 
Eq. h!15\l . The average of this signal is equal to rj 2 = 10~ 5 (horizontal line) in agreement with the 
theoretical prediction. 

which for this data is about one minute. The amplitude of the peaks is quite significant 
compared to the noise. We argue that these peaks are precisely temporary fluctuations 
with A 2 7^ 0, that is, non-zero curvature events in the market. To show that these are not 
isolated events, figure 11 shows the histogram for the full data sample. We can see significant 
positive skewness in the signal, compared to the daily data (c.f. figure 7). In fact, we find a 

significant signal to noise ratio: E[*4. 2 ]/ y^Var[y4 2 ] ~ 0.32. The integrated profit and losses of 
the arbitrage porfolio of Eq. (11 17ft is shown in figure 12. We can see a very good performance 
in comparison with the daily data (c.f. figure 9). Because of model risk, such portfolio can 
indeed have a finite probability of a loss on short time scales. However, we see that on longer 
scales (integrated signal), the probability of a loss goes to zero asymptotically as t — > oo. 
This is an example of a statistical arbitrage as discussed in |Po07j IBo03j . 

We have also studied the effect of the microstructure noise on the high frequency signal. 
In particular, we have computed the estimate of the noise fj 2 defined in Eq. ( 11151) . We have 
found that for this particular data sample, the contribution from such noise is very low: 
E[r} 2 ]/ ^/Var [fj 2 ] « 0.04. The effect becomes quite significant, however, if we look at traded 
assets such as stocks and futures. 

Our next data sample consist of the following set of US index futures: E-Mini S&P 500 
(ESU09.CME), DJIA mini-sized (YMU09.CBT), E-Mini Nasdaq 100 (NQU09.CME) and 
S&P 500 Index Future (SPU09.CME). The data was collected on August 9, 2009, and all 
futures expire on September 2009. We have collected prices with a frequency of 7-10 seconds 
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Figure 7: Arbitrage detection algorithm applied to daily closing prices of 3 major US indexes: DJA, 
IXIC and NYA. Here we show a sample of 500 data points. The green lines are an estimate of 
the variance of the gauge invariant noise using Eq. $116\) . The error bars give the range of values 
obtained using the different gauges. The solid dots are the mean of all results. The red line is the 
USD signal. 

separation, using the "FinancialData" package of Mathematica. These securities are highly 
correlated. Therefore, they are ideal for the search of the arbitrage signal. However, since 
these are traded instruments, the effect of the bid/ask spread is more pronounced. In figure 13 
we show the histogram of the values of A 2 obtained by applying exactly the same algorithm 
as in the previous example. We get a very poor signal; in fact, we get a negative mean: 
E[^4 2 ]/ y^Var^ 2 ] « —0.061. The integrated profit and losses of the simple portfolio of Eq. 
(11171) are shown in figure 14. 

We have also calculated the effect of the microstructure noise, by computing the estimate 
fj 2 defined in Eq. (11151) . The effect for this data sample is about an order of magnitude bigger 
than the previous example: E[^ 2 ]/y / Var~j?p] « 0.15. This noise is the main obstacle to a 
detection of A 2 . Nevertheless, one can devise more complicated detection methods which 
filter out the microstructure noise. In figure 14 we show the integrated profit and losses of 
a particular proprietary strategy. A detailed study of such strategies is beyond the scope of 
this paper. 
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Figure 8: Histogram of different values of A 2 obtained using daily market data for DJA, IXIC and 
NYA. The signal to noise ratio is: E[A 2 ]/J Var[A 2 ] « -0.0709. 



5 Conclusions 



In this paper we have defined a general measure of arbitrage which is invariant under changes 
of numeraire and equivalent probability measure. Our main assumption is that all financial 
instruments can be described by Ito processes. This is not a very strong assumption as 
many complex financial models, including those reflecting the non-Gaussian nature of stock 
returns, can be modeled this way. We showed that the gauge invariant arbitrage measure can 
be interpreted in terms of the curvature of the stochastic version of the Malaney-Weinstein 
connection ( [Ma96j . |We06j ). The zero curvature condition is then equivalent to the no- 
arbitrage principle. Moreover, we demonstrated a simple generalization of the classic asset 
pricing theorem to include arbitrage. Finally, we have presented a basic algorithm to mea- 
sure the market curvature using financial data. We found evidence for non-zero curvature 
fluctuations in high frequency data involving stock indexes and index futures. 

From a financial perspective, our algorithms can be used to exploit arbitrage system- 
atically, and generate profitable trading strategies. This will require much more empirical 
research, and the development of more sophisticated techniques to estimate the arbitrage 
curvature measure A 2 . This is left for future work. 

From a scientific perspective, we believe that our findings represent a modest step towards 
the understanding of the non-equilibrium market dynamics. Gauge theories provide the 
natural mathematical language to that aim, and arbitrage opportunities can be interpreted 
as a non-zero curvature fluctuation in an economy out of equilibrium. It is interesting how 
most of our current economic and financial thinking rely so much on the assumptions of 
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Figure 9: Integrated profit and losses of the arbitrage portfolio (red line) for the daily data of the 
US stock indexes DJA, IXIC and NYA. We also show the integrated profit and losses of the indexes 
themselves. 

general equilibrium theory. 

There has been a growing consensus that we need a better understanding of the non- 
equilibrium dynamics of the economy (see e.g. |FaGe08j ). In particular, one would like to 
understand what is the relaxation time scale for non-equilibrium fluctuations to disappear (if 
they do). Within the limited data sample that we have shown in this paper, the relaxation 
time seems to be of the order of one minute. However, this can be very different in other 
sectors of the market. 
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Figure 10: Arbitrage detection algorithm applied to high frequency data for 3 major US indexes: 
DJA, IXIC and NYA. Here we show a sample of 100 data points. The green lines are an estimate 
of the variance of the gauge invariant noise using Eq. $116\) . The error bars give the range of 
values obtained using the different gauges. The solid dots are the mean of all results. The red line 
is the USD signal. 

Appendix A 

In this appendix we consider the following general mathematical problem. Suppose we are 
given two NxN symmetric and real matrices which we label by G(t) and G(t+1) respectively. 
Moreover, both matrices have a null space with the same dimensionality. In other words, let 
J A {t) and J A (t + 1) be orthonormal basis vectors such that 

G(t)J A (t) = 0, A=l,...,k, G(t + 1) J B (t + 1) = , B = l,...,tf, (118) 

and k = k' . Now let || • || be some matrix norm such that — G(t + 1)|| <C 1. More 

precisely, we are interested in the limit in which both matrices are very similar to each other. 
We then wish to find a k x k rotation matrix C such that 



In physics, this problem arises in the so-called degenerate perturbation theory |Sa94] . How- 
ever, it is often assumed that G(t + 1) will "lift the degeneracy", which in our case means 



lim 

||G(t)-G(*+l)|H0 





(119) 



B 



38 




-l.xl(r s 5.x 10 



5.x 10"" 1.X10- 8 1.5x10-* 2.XKT* 2.5x10-' 



Figure 11: Histogram of A 2 obtained using high frequency data for DJA, IXIC and NYA. The 
signal to noise ratio is: K[A 2 ]/\/ Var[A 2 ] ~ 0.32. 



k' < k. This is not the case in our problem, and so the usual perturbation theory methods 
do not work. 

We are interested in solving the problem above only approximately. In fact, in practice we 
have no control over G(t) and G(t + 1), and so we cannot take the limit || £?(£) — G(t + l)|| — > 
explicitly. However, we will assume that — G(t + 1)|| «C 1 (which is true most of the 

time). Now consider the k x k real matrix C with components 



C 



AB 



[j A (t+l)]' J B (t) 



(120) 



Under our assumptions, we have 



c ab ^ c ab + Q ( 5G j 



~<AB 



(121) 



where 5G — G(t) — G(t + 1). Therefore, we seek a rotation matrix C which approximates C 
as closely as possible. In the limit ||G(t) — G(t + 1)|| — ^ we should have C — >• C . In other 
words, C will be the our best approximation to C. 

We can solve this problem using Lagrange multipliers. We seek to minimize the La- 
grangian 



C= Tr 



(c - cy {c - c) 



Tr 



A (C T C — 1) 



(122) 



where A is a symmetric matrix which serves as a Lagrange multiplier implementing the 
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Figure 12: Integrated profit and losses of the arbitrage portfolio (red line) for the high frequency 
data of the US stock indexes DJA, IXIC and NYA. We also show the integrated profit and losses 
of the indexes themselves. 



constraint C T C = 1. The Lagrange equations are then 

8C o, J4 = o 



dC AB 



d\ AB 



(123) 



We will only present the solution to Eqs. (I123p for the cases of k = 1 and k = 2. 

For k — 1, the rotation matrix C is simply a sign and one can show that the minimum 
of C is 

C = sign(C) . (124) 
For k = 2, the solution can be written in terms of the Pauli matrices: 



cr := 



1 
1 



1 

1 



a 2 



1 
-1 



o"3 



1 
-1 



(125) 



We can expand, 



where 



i=0 



i=0 



i=0 



Ci = - Tr(aJ C) , i = 0,...,3, 



(126) 



(127) 
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Figure 13: Histogram of A 2 obtained using high frequency data for the index futures: ESU09.CME, 



YMU09.CBT, NQU09.CME, SPU09.CME. The signal to noise ratio is: E[A 2 ]/y Var[A 2 } « 
—0.061. This picture illustrates the negative effects of the market microstructure noise for the 
simplest trading strategy. 



and similarly for c\ and \. The solution to the Lagrange equations, Eq. f)123j) . which 
correspond to the minimum of C can be shown to be 



Co = c 3 = , ci 



Cl 



~c\ + cl 



c 2 



c 2 



c\ + ~cl 



(128) 



for c 2 + c 2 .>c 2 l + c§, and 



ci = c 2 = , c 



c 



c 3 



c 3 



cg + cl 



(129) 



otherwise. 
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